Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 265471 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 24.3 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 12 |
|---|
df_index is highly correlated with ID | High correlation |
ID is highly correlated with df_index | High correlation |
u is highly correlated with g and 6 other fields | High correlation |
g is highly correlated with u and 8 other fields | High correlation |
r is highly correlated with u and 8 other fields | High correlation |
i is highly correlated with u and 7 other fields | High correlation |
z is highly correlated with u and 7 other fields | High correlation |
uErr is highly correlated with u and 4 other fields | High correlation |
gErr is highly correlated with u and 8 other fields | High correlation |
rErr is highly correlated with u and 8 other fields | High correlation |
iErr is highly correlated with g and 6 other fields | High correlation |
zErr is highly correlated with g and 6 other fields | High correlation |
df_index is highly correlated with ID | High correlation |
ID is highly correlated with df_index | High correlation |
u is highly correlated with g and 3 other fields | High correlation |
g is highly correlated with u and 3 other fields | High correlation |
r is highly correlated with u and 3 other fields | High correlation |
i is highly correlated with u and 3 other fields | High correlation |
z is highly correlated with u and 3 other fields | High correlation |
uErr is highly correlated with gErr and 1 other fields | High correlation |
gErr is highly correlated with uErr and 1 other fields | High correlation |
rErr is highly correlated with uErr and 1 other fields | High correlation |
df_index is highly correlated with ID | High correlation |
ID is highly correlated with df_index | High correlation |
u is highly correlated with g and 2 other fields | High correlation |
g is highly correlated with u and 5 other fields | High correlation |
r is highly correlated with g and 6 other fields | High correlation |
i is highly correlated with g and 6 other fields | High correlation |
z is highly correlated with g and 6 other fields | High correlation |
uErr is highly correlated with u and 1 other fields | High correlation |
gErr is highly correlated with u and 7 other fields | High correlation |
rErr is highly correlated with g and 6 other fields | High correlation |
iErr is highly correlated with r and 5 other fields | High correlation |
zErr is highly correlated with r and 4 other fields | High correlation |
df_index is highly correlated with ID | High correlation |
ID is highly correlated with df_index | High correlation |
u is highly correlated with g and 6 other fields | High correlation |
g is highly correlated with u and 6 other fields | High correlation |
r is highly correlated with u and 6 other fields | High correlation |
i is highly correlated with u and 3 other fields | High correlation |
z is highly correlated with u and 3 other fields | High correlation |
uErr is highly correlated with u and 6 other fields | High correlation |
gErr is highly correlated with u and 6 other fields | High correlation |
rErr is highly correlated with u and 6 other fields | High correlation |
iErr is highly correlated with uErr and 3 other fields | High correlation |
zErr is highly correlated with uErr and 3 other fields | High correlation |
uErr is highly skewed (γ1 = 441.8980897) | Skewed |
gErr is highly skewed (γ1 = 407.9554013) | Skewed |
rErr is highly skewed (γ1 = 115.6323666) | Skewed |
iErr is highly skewed (γ1 = 137.6867747) | Skewed |
zErr is highly skewed (γ1 = 40.41128375) | Skewed |
ID has unique values | Unique |
Reproduction
| Analysis started | 2022-02-24 03:57:30.218880 |
|---|---|
| Analysis finished | 2022-02-24 03:58:03.347150 |
| Duration | 33.13 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 74950 |
|---|---|
| Distinct (%) | 28.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36057.1363 |
| Minimum | 0 |
|---|---|
| Maximum | 74949 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3704 |
| Q1 | 18433 |
| median | 35819 |
| Q3 | 53240 |
| 95-th percentile | 70011.5 |
| Maximum | 74949 |
| Range | 74949 |
| Interquartile range (IQR) | 34807 |
Descriptive statistics
| Standard deviation | 20796.5678 |
|---|---|
| Coefficient of variation (CV) | 0.5767670406 |
| Kurtosis | -1.109257609 |
| Mean | 36057.1363 |
| Median Absolute Deviation (MAD) | 17404 |
| Skewness | 0.0508676695 |
| Sum | 9572124031 |
| Variance | 432497232.1 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 37475 | 4 | < 0.1% |
| 37369 | 4 | < 0.1% |
| 37345 | 4 | < 0.1% |
| 37346 | 4 | < 0.1% |
| 60267 | 4 | < 0.1% |
| 37348 | 4 | < 0.1% |
| 37349 | 4 | < 0.1% |
| 37350 | 4 | < 0.1% |
| 37351 | 4 | < 0.1% |
| 37352 | 4 | < 0.1% |
| Other values (74940) | 265431 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 4 | |
| 2 | 3 | |
| 3 | 4 | |
| 4 | 3 | |
| 5 | 3 | |
| 6 | 3 | |
| 7 | 3 | |
| 8 | 4 | |
| 9 | 4 |
| Value | Count | Frequency (%) |
| 74949 | 1 | |
| 74948 | 1 | |
| 74947 | 1 | |
| 74946 | 1 | |
| 74945 | 1 | |
| 74944 | 1 | |
| 74943 | 1 | |
| 74942 | 1 | |
| 74941 | 1 | |
| 74940 | 1 |
| Distinct | 265471 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.23766412 × 1018 |
| Minimum | 1.23764588 × 1018 |
|---|---|
| Maximum | 1.237680531 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 1.23764588 × 1018 |
|---|---|
| 5-th percentile | 1.23765125 × 1018 |
| Q1 | 1.237657192 × 1018 |
| median | 1.237663239 × 1018 |
| Q3 | 1.237668584 × 1018 |
| 95-th percentile | 1.237679438 × 1018 |
| Maximum | 1.237680531 × 1018 |
| Range | 3.465179365 × 1013 |
| Interquartile range (IQR) | 1.139238011 × 1013 |
Descriptive statistics
| Standard deviation | 9.180408119 × 1012 |
|---|---|
| Coefficient of variation (CV) | 7.417527883 × 10-6 |
| Kurtosis | -0.8589275398 |
| Mean | 1.23766412 × 1018 |
| Median Absolute Deviation (MAD) | 5.651643696 × 1012 |
| Skewness | 0.3502992022 |
| Sum | 8.972874912 × 1018 |
| Variance | 8.427989322 × 1025 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.23764588 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| 1.237666339 × 1018 | 1 | < 0.1% |
| Other values (265461) | 265461 |
| Value | Count | Frequency (%) |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.23764588 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 | |
| 1.237645943 × 1018 | 1 |
| Value | Count | Frequency (%) |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 | |
| 1.237680531 × 1018 | 1 |
| Distinct | 256277 |
|---|---|
| Distinct (%) | 96.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.4897471 |
| Minimum | 6.137899 |
|---|---|
| Maximum | 31.474758 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 6.137899 |
|---|---|
| 5-th percentile | 18.790246 |
| Q1 | 21.248047 |
| median | 22.690958 |
| Q3 | 23.9247045 |
| 95-th percentile | 25.694047 |
| Maximum | 31.474758 |
| Range | 25.336859 |
| Interquartile range (IQR) | 2.6766575 |
Descriptive statistics
| Standard deviation | 2.0892633 |
|---|---|
| Coefficient of variation (CV) | 0.09289847906 |
| Kurtosis | -0.07758833777 |
| Mean | 22.4897471 |
| Median Absolute Deviation (MAD) | 1.314068 |
| Skewness | -0.403543113 |
| Sum | 5970375.652 |
| Variance | 4.365021136 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 23.784828 | 4 | < 0.1% |
| 22.60141 | 4 | < 0.1% |
| 21.96266 | 4 | < 0.1% |
| 23.609325 | 4 | < 0.1% |
| 24.32449 | 3 | < 0.1% |
| 22.059561 | 3 | < 0.1% |
| 23.790031 | 3 | < 0.1% |
| 22.655876 | 3 | < 0.1% |
| 22.11698 | 3 | < 0.1% |
| 22.129829 | 3 | < 0.1% |
| Other values (256267) | 265437 |
| Value | Count | Frequency (%) |
| 6.137899 | 1 | |
| 7.684486 | 1 | |
| 7.85844 | 1 | |
| 8.107904 | 1 | |
| 8.174813 | 1 | |
| 8.223669 | 1 | |
| 9.046374 | 1 | |
| 9.445343 | 1 | |
| 9.599357 | 1 | |
| 9.680221 | 1 |
| Value | Count | Frequency (%) |
| 31.474758 | 1 | |
| 30.669785 | 1 | |
| 30.045591 | 1 | |
| 30.029575 | 1 | |
| 29.914965 | 1 | |
| 29.584764 | 1 | |
| 29.497486 | 1 | |
| 29.22876 | 1 | |
| 29.190369 | 1 | |
| 28.978514 | 1 |
| Distinct | 253305 |
|---|---|
| Distinct (%) | 95.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.9485549 |
| Minimum | 7.446142 |
|---|---|
| Maximum | 32.311321 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 7.446142 |
|---|---|
| 5-th percentile | 17.3354195 |
| Q1 | 19.753766 |
| median | 21.534155 |
| Q3 | 22.319359 |
| 95-th percentile | 23.350527 |
| Maximum | 32.311321 |
| Range | 24.865179 |
| Interquartile range (IQR) | 2.565593 |
Descriptive statistics
| Standard deviation | 1.944626527 |
|---|---|
| Coefficient of variation (CV) | 0.09282867181 |
| Kurtosis | -0.01322663839 |
| Mean | 20.9485549 |
| Median Absolute Deviation (MAD) | 1.02001 |
| Skewness | -0.7648934951 |
| Sum | 5561233.817 |
| Variance | 3.781572331 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 22.362343 | 4 | < 0.1% |
| 22.055502 | 4 | < 0.1% |
| 21.692204 | 4 | < 0.1% |
| 22.892004 | 4 | < 0.1% |
| 18.476742 | 4 | < 0.1% |
| 22.521631 | 4 | < 0.1% |
| 21.936327 | 4 | < 0.1% |
| 22.47476 | 4 | < 0.1% |
| 21.252529 | 3 | < 0.1% |
| 22.010782 | 3 | < 0.1% |
| Other values (253295) | 265433 |
| Value | Count | Frequency (%) |
| 7.446142 | 1 | |
| 8.241127 | 1 | |
| 8.685551 | 1 | |
| 8.854282 | 1 | |
| 8.879968 | 1 | |
| 9.043655 | 1 | |
| 9.541052 | 1 | |
| 9.955412 | 1 | |
| 10.04998 | 1 | |
| 10.204456 | 1 |
| Value | Count | Frequency (%) |
| 32.311321 | 1 | |
| 32.180359 | 1 | |
| 32.180218 | 1 | |
| 31.12199 | 1 | |
| 31.036724 | 1 | |
| 30.37472 | 1 | |
| 30.274063 | 1 | |
| 30.004261 | 1 | |
| 29.916021 | 1 | |
| 29.698931 | 1 |
| Distinct | 253330 |
|---|---|
| Distinct (%) | 95.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.71917763 |
| Minimum | 8.510301 |
|---|---|
| Maximum | 30.481144 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 8.510301 |
|---|---|
| 5-th percentile | 16.5637895 |
| Q1 | 18.49573 |
| median | 20.201982 |
| Q3 | 20.99195 |
| 95-th percentile | 22.0363685 |
| Maximum | 30.481144 |
| Range | 21.970843 |
| Interquartile range (IQR) | 2.49622 |
Descriptive statistics
| Standard deviation | 1.75713942 |
|---|---|
| Coefficient of variation (CV) | 0.0891081491 |
| Kurtosis | -0.1970252936 |
| Mean | 19.71917763 |
| Median Absolute Deviation (MAD) | 1.0718 |
| Skewness | -0.6758215825 |
| Sum | 5234869.805 |
| Variance | 3.087538942 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 20.397943 | 5 | < 0.1% |
| 20.431469 | 5 | < 0.1% |
| 20.158337 | 5 | < 0.1% |
| 20.87731 | 5 | < 0.1% |
| 20.88925 | 4 | < 0.1% |
| 20.369577 | 4 | < 0.1% |
| 20.594721 | 4 | < 0.1% |
| 20.215315 | 4 | < 0.1% |
| 20.970186 | 4 | < 0.1% |
| 21.445936 | 4 | < 0.1% |
| Other values (253320) | 265427 |
| Value | Count | Frequency (%) |
| 8.510301 | 1 | |
| 8.871452 | 1 | |
| 9.381433 | 1 | |
| 9.537403 | 1 | |
| 9.54484 | 1 | |
| 9.806157 | 1 | |
| 9.871026 | 1 | |
| 10.18564 | 1 | |
| 10.685715 | 1 | |
| 10.713271 | 1 |
| Value | Count | Frequency (%) |
| 30.481144 | 1 | |
| 27.92935 | 1 | |
| 27.586512 | 1 | |
| 27.531603 | 1 | |
| 27.443832 | 1 | |
| 27.340242 | 1 | |
| 26.760141 | 1 | |
| 26.303787 | 1 | |
| 26.298737 | 1 | |
| 26.275991 | 1 |
| Distinct | 253727 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.1421622 |
| Minimum | 9.260902 |
|---|---|
| Maximum | 32.286316 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 9.260902 |
|---|---|
| 5-th percentile | 16.1717205 |
| Q1 | 17.974121 |
| median | 19.415615 |
| Q3 | 20.274704 |
| 95-th percentile | 21.7420865 |
| Maximum | 32.286316 |
| Range | 23.025414 |
| Interquartile range (IQR) | 2.300583 |
Descriptive statistics
| Standard deviation | 1.729917104 |
|---|---|
| Coefficient of variation (CV) | 0.09037208472 |
| Kurtosis | 0.03961488273 |
| Mean | 19.1421622 |
| Median Absolute Deviation (MAD) | 1.0865 |
| Skewness | -0.3474433238 |
| Sum | 5081688.942 |
| Variance | 2.992613187 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 19.47208 | 4 | < 0.1% |
| 19.364578 | 4 | < 0.1% |
| 19.702448 | 4 | < 0.1% |
| 19.574993 | 4 | < 0.1% |
| 19.677233 | 4 | < 0.1% |
| 19.386427 | 4 | < 0.1% |
| 19.962057 | 4 | < 0.1% |
| 19.768839 | 4 | < 0.1% |
| 19.504345 | 4 | < 0.1% |
| 19.263306 | 4 | < 0.1% |
| Other values (253717) | 265431 |
| Value | Count | Frequency (%) |
| 9.260902 | 1 | |
| 9.45409 | 1 | |
| 9.481367 | 1 | |
| 9.785824 | 1 | |
| 10.00429 | 1 | |
| 10.210069 | 1 | |
| 10.499023 | 1 | |
| 10.601875 | 1 | |
| 10.667053 | 1 | |
| 11.098452 | 1 |
| Value | Count | Frequency (%) |
| 32.286316 | 1 | |
| 31.575436 | 1 | |
| 31.350872 | 1 | |
| 31.254124 | 1 | |
| 31.156153 | 1 | |
| 31.13089 | 1 | |
| 31.073492 | 1 | |
| 30.873112 | 1 | |
| 30.720078 | 1 | |
| 30.709589 | 1 |
| Distinct | 253767 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.81974907 |
| Minimum | 9.688597 |
|---|---|
| Maximum | 29.146568 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 9.688597 |
|---|---|
| 5-th percentile | 15.876671 |
| Q1 | 17.6424865 |
| median | 19.002357 |
| Q3 | 19.89907 |
| 95-th percentile | 21.6314295 |
| Maximum | 29.146568 |
| Range | 19.457971 |
| Interquartile range (IQR) | 2.2565835 |
Descriptive statistics
| Standard deviation | 1.757823963 |
|---|---|
| Coefficient of variation (CV) | 0.09340315623 |
| Kurtosis | 0.04937809918 |
| Mean | 18.81974907 |
| Median Absolute Deviation (MAD) | 1.082673 |
| Skewness | -0.1689879989 |
| Sum | 4996097.605 |
| Variance | 3.089945084 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 19.214474 | 4 | < 0.1% |
| 19.518696 | 4 | < 0.1% |
| 19.154552 | 4 | < 0.1% |
| 19.348778 | 4 | < 0.1% |
| 19.279848 | 4 | < 0.1% |
| 19.196115 | 4 | < 0.1% |
| 18.947657 | 4 | < 0.1% |
| 18.858974 | 4 | < 0.1% |
| 18.792511 | 4 | < 0.1% |
| 19.009991 | 4 | < 0.1% |
| Other values (253757) | 265431 |
| Value | Count | Frequency (%) |
| 9.688597 | 1 | |
| 10.111669 | 1 | |
| 10.138 | 1 | |
| 10.246193 | 1 | |
| 10.44339 | 1 | |
| 10.667935 | 1 | |
| 10.677568 | 1 | |
| 10.737098 | 1 | |
| 10.83986 | 1 | |
| 10.881444 | 1 |
| Value | Count | Frequency (%) |
| 29.146568 | 1 | |
| 29.105049 | 1 | |
| 28.954861 | 1 | |
| 28.860327 | 1 | |
| 28.741753 | 1 | |
| 28.71353 | 1 | |
| 28.68885 | 1 | |
| 28.670599 | 1 | |
| 28.620174 | 1 | |
| 28.615345 | 1 |
| Distinct | 232977 |
|---|---|
| Distinct (%) | 87.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.506739932 |
| Minimum | 0.011919 |
|---|---|
| Maximum | 973.115381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0.011919 |
|---|---|
| 5-th percentile | 0.0407055 |
| Q1 | 0.1616275 |
| median | 0.421703 |
| Q3 | 0.738313 |
| 95-th percentile | 1.2357135 |
| Maximum | 973.115381 |
| Range | 973.103462 |
| Interquartile range (IQR) | 0.5766855 |
Descriptive statistics
| Standard deviation | 1.992068476 |
|---|---|
| Coefficient of variation (CV) | 3.931145644 |
| Kurtosis | 214288.0944 |
| Mean | 0.506739932 |
| Median Absolute Deviation (MAD) | 0.284124 |
| Skewness | 441.8980897 |
| Sum | 134524.7565 |
| Variance | 3.968336815 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.04659 | 6 | < 0.1% |
| 0.094035 | 6 | < 0.1% |
| 0.068175 | 6 | < 0.1% |
| 0.063306 | 6 | < 0.1% |
| 0.039419 | 6 | < 0.1% |
| 0.098549 | 6 | < 0.1% |
| 0.071352 | 5 | < 0.1% |
| 0.064955 | 5 | < 0.1% |
| 0.053302 | 5 | < 0.1% |
| 0.071216 | 5 | < 0.1% |
| Other values (232967) | 265415 |
| Value | Count | Frequency (%) |
| 0.011919 | 1 | |
| 0.012259 | 1 | |
| 0.012604 | 1 | |
| 0.013049 | 1 | |
| 0.013249 | 1 | |
| 0.013527 | 1 | |
| 0.013617 | 1 | |
| 0.013656 | 1 | |
| 0.013725 | 1 | |
| 0.01378 | 1 |
| Value | Count | Frequency (%) |
| 973.115381 | 1 | |
| 163.304649 | 1 | |
| 115.003115 | 1 | |
| 86.184253 | 1 | |
| 70.904198 | 1 | |
| 61.491178 | 1 | |
| 53.891129 | 1 | |
| 40.433575 | 1 | |
| 22.030731 | 1 | |
| 17.775392 | 1 |
| Distinct | 171592 |
|---|---|
| Distinct (%) | 64.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1677167151 |
| Minimum | 0.021987 |
|---|---|
| Maximum | 708.703847 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0.021987 |
|---|---|
| 5-th percentile | 0.030645 |
| Q1 | 0.0527855 |
| median | 0.124564 |
| Q3 | 0.20737 |
| 95-th percentile | 0.428923 |
| Maximum | 708.703847 |
| Range | 708.68186 |
| Interquartile range (IQR) | 0.1545845 |
Descriptive statistics
| Standard deviation | 1.506463924 |
|---|---|
| Coefficient of variation (CV) | 8.982193114 |
| Kurtosis | 186335.3772 |
| Mean | 0.1677167151 |
| Median Absolute Deviation (MAD) | 0.075263 |
| Skewness | 407.9554013 |
| Sum | 44523.92408 |
| Variance | 2.269433553 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.033108 | 13 | < 0.1% |
| 0.03242 | 12 | < 0.1% |
| 0.037837 | 11 | < 0.1% |
| 0.034676 | 11 | < 0.1% |
| 0.033014 | 11 | < 0.1% |
| 0.038172 | 11 | < 0.1% |
| 0.033157 | 11 | < 0.1% |
| 0.030374 | 11 | < 0.1% |
| 0.034204 | 11 | < 0.1% |
| 0.03348 | 11 | < 0.1% |
| Other values (171582) | 265358 |
| Value | Count | Frequency (%) |
| 0.021987 | 1 | |
| 0.022127 | 1 | |
| 0.022439 | 1 | |
| 0.022488 | 1 | |
| 0.022552 | 1 | |
| 0.022594 | 1 | |
| 0.022599 | 1 | |
| 0.022654 | 1 | |
| 0.022695 | 1 | |
| 0.022697 | 1 |
| Value | Count | Frequency (%) |
| 708.703847 | 1 | |
| 220.567456 | 1 | |
| 121.828836 | 1 | |
| 90.426192 | 1 | |
| 83.855285 | 1 | |
| 77.903199 | 1 | |
| 43.265003 | 1 | |
| 37.209066 | 1 | |
| 36.091322 | 1 | |
| 32.901323 | 1 |
| Distinct | 143132 |
|---|---|
| Distinct (%) | 53.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1199264538 |
| Minimum | 0.034156 |
|---|---|
| Maximum | 39.832404 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0.034156 |
|---|---|
| 5-th percentile | 0.0458615 |
| Q1 | 0.0625315 |
| median | 0.100181 |
| Q3 | 0.1510175 |
| 95-th percentile | 0.262194 |
| Maximum | 39.832404 |
| Range | 39.798248 |
| Interquartile range (IQR) | 0.088486 |
Descriptive statistics
| Standard deviation | 0.1420781739 |
|---|---|
| Coefficient of variation (CV) | 1.184710874 |
| Kurtosis | 26445.80792 |
| Mean | 0.1199264538 |
| Median Absolute Deviation (MAD) | 0.041204 |
| Skewness | 115.6323666 |
| Sum | 31836.9956 |
| Variance | 0.0201862075 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.051044 | 13 | < 0.1% |
| 0.044166 | 12 | < 0.1% |
| 0.055276 | 12 | < 0.1% |
| 0.05279 | 12 | < 0.1% |
| 0.050936 | 11 | < 0.1% |
| 0.056868 | 11 | < 0.1% |
| 0.056232 | 11 | < 0.1% |
| 0.050227 | 11 | < 0.1% |
| 0.053529 | 11 | < 0.1% |
| 0.046386 | 11 | < 0.1% |
| Other values (143122) | 265356 |
| Value | Count | Frequency (%) |
| 0.034156 | 1 | |
| 0.034476 | 1 | |
| 0.034644 | 1 | |
| 0.034697 | 1 | |
| 0.034901 | 1 | |
| 0.035019 | 1 | |
| 0.035119 | 1 | |
| 0.035207 | 1 | |
| 0.035265 | 1 | |
| 0.035289 | 1 |
| Value | Count | Frequency (%) |
| 39.832404 | 1 | |
| 22.327885 | 1 | |
| 12.858185 | 1 | |
| 12.272254 | 1 | |
| 12.186537 | 1 | |
| 10.61393 | 1 | |
| 9.502449 | 1 | |
| 8.952156 | 1 | |
| 8.897392 | 1 | |
| 8.373005 | 1 |
| Distinct | 136364 |
|---|---|
| Distinct (%) | 51.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1328689264 |
| Minimum | 0.033318 |
|---|---|
| Maximum | 66.143307 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0.033318 |
|---|---|
| 5-th percentile | 0.0575865 |
| Q1 | 0.0750465 |
| median | 0.10011 |
| Q3 | 0.143197 |
| 95-th percentile | 0.306796 |
| Maximum | 66.143307 |
| Range | 66.109989 |
| Interquartile range (IQR) | 0.0681505 |
Descriptive statistics
| Standard deviation | 0.2164796617 |
|---|---|
| Coefficient of variation (CV) | 1.629272302 |
| Kurtosis | 35650.32194 |
| Mean | 0.1328689264 |
| Median Absolute Deviation (MAD) | 0.029815 |
| Skewness | 137.6867747 |
| Sum | 35272.84677 |
| Variance | 0.04686344391 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.076775 | 12 | < 0.1% |
| 0.070139 | 11 | < 0.1% |
| 0.076402 | 11 | < 0.1% |
| 0.071658 | 11 | < 0.1% |
| 0.077603 | 11 | < 0.1% |
| 0.075189 | 11 | < 0.1% |
| 0.075621 | 11 | < 0.1% |
| 0.060718 | 11 | < 0.1% |
| 0.068524 | 11 | < 0.1% |
| 0.082396 | 10 | < 0.1% |
| Other values (136354) | 265361 |
| Value | Count | Frequency (%) |
| 0.033318 | 1 | |
| 0.039556 | 1 | |
| 0.039569 | 1 | |
| 0.04054 | 1 | |
| 0.041432 | 1 | |
| 0.041443 | 1 | |
| 0.041504 | 1 | |
| 0.041734 | 1 | |
| 0.041752 | 1 | |
| 0.042023 | 1 |
| Value | Count | Frequency (%) |
| 66.143307 | 1 | |
| 32.41625 | 1 | |
| 20.440717 | 1 | |
| 20.024466 | 1 | |
| 18.334789 | 1 | |
| 17.491723 | 1 | |
| 13.247561 | 1 | |
| 11.959482 | 1 | |
| 11.846359 | 1 | |
| 11.37197 | 1 |
| Distinct | 171161 |
|---|---|
| Distinct (%) | 64.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2216285618 |
| Minimum | 0.044069 |
|---|---|
| Maximum | 47.529248 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0.044069 |
|---|---|
| 5-th percentile | 0.079096 |
| Q1 | 0.109228 |
| median | 0.148065 |
| Q3 | 0.2278285 |
| 95-th percentile | 0.648622 |
| Maximum | 47.529248 |
| Range | 47.485179 |
| Interquartile range (IQR) | 0.1186005 |
Descriptive statistics
| Standard deviation | 0.2936028413 |
|---|---|
| Coefficient of variation (CV) | 1.324751823 |
| Kurtosis | 4242.307003 |
| Mean | 0.2216285618 |
| Median Absolute Deviation (MAD) | 0.048007 |
| Skewness | 40.41128375 |
| Sum | 58835.95594 |
| Variance | 0.08620262843 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.115216 | 9 | < 0.1% |
| 0.108653 | 9 | < 0.1% |
| 0.114094 | 8 | < 0.1% |
| 0.113751 | 8 | < 0.1% |
| 0.088678 | 8 | < 0.1% |
| 0.10685 | 8 | < 0.1% |
| 0.088793 | 8 | < 0.1% |
| 0.128688 | 8 | < 0.1% |
| 0.138379 | 8 | < 0.1% |
| 0.102622 | 8 | < 0.1% |
| Other values (171151) | 265389 |
| Value | Count | Frequency (%) |
| 0.044069 | 1 | |
| 0.044153 | 1 | |
| 0.044297 | 1 | |
| 0.044667 | 1 | |
| 0.045021 | 1 | |
| 0.045421 | 1 | |
| 0.046718 | 1 | |
| 0.047086 | 1 | |
| 0.04752 | 1 | |
| 0.047619 | 1 |
| Value | Count | Frequency (%) |
| 47.529248 | 1 | |
| 29.875302 | 1 | |
| 28.713682 | 1 | |
| 26.365085 | 1 | |
| 23.711179 | 1 | |
| 22.242674 | 1 | |
| 19.015472 | 1 | |
| 17.928994 | 1 | |
| 17.246654 | 1 | |
| 16.943693 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | ID | u | g | r | i | z | uErr | gErr | rErr | iErr | zErr | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1237645879562928805 | 25.155375 | 22.232981 | 21.257841 | 19.889854 | 19.427107 | 1.034657 | 0.219033 | 0.206952 | 0.147347 | 0.228686 |
| 1 | 1 | 1237645942905504040 | 21.910667 | 19.464439 | 18.372978 | 17.955502 | 17.642979 | 0.443852 | 0.060666 | 0.064809 | 0.069981 | 0.096798 |
| 2 | 2 | 1237645942905635180 | 20.978670 | 20.276821 | 19.472305 | 19.232706 | 18.994177 | 0.150120 | 0.070810 | 0.076891 | 0.085570 | 0.132459 |
| 3 | 3 | 1237645942906224860 | 20.389980 | 19.313643 | 17.891615 | 17.415066 | 17.181231 | 0.331636 | 0.113295 | 0.092397 | 0.093178 | 0.127449 |
| 4 | 4 | 1237645943978721520 | 23.907169 | 23.448675 | 20.256517 | 19.163860 | 18.557152 | 1.527960 | 1.033415 | 0.177488 | 0.132869 | 0.149485 |
| 5 | 5 | 1237645943978787138 | 20.503014 | 19.280542 | 18.830477 | 18.559280 | 18.481827 | 0.120214 | 0.044740 | 0.061519 | 0.071990 | 0.098771 |
| 6 | 6 | 1237645943978787212 | 21.728035 | 19.977358 | 18.901865 | 18.379099 | 18.037195 | 0.315895 | 0.061951 | 0.063594 | 0.068638 | 0.084922 |
| 7 | 7 | 1237645943978787238 | 20.095301 | 18.808825 | 18.192244 | 17.883970 | 17.655457 | 0.139342 | 0.054625 | 0.073051 | 0.085504 | 0.116468 |
| 8 | 8 | 1237645943978852526 | 19.983652 | 18.805828 | 17.918039 | 17.496410 | 17.238651 | 0.092822 | 0.042239 | 0.054085 | 0.062537 | 0.077156 |
| 9 | 9 | 1237645943978852865 | 24.602087 | 21.488087 | 19.520910 | 18.921425 | 18.594252 | 0.866716 | 0.179641 | 0.090975 | 0.091485 | 0.116313 |
Last rows
| df_index | ID | u | g | r | i | z | uErr | gErr | rErr | iErr | zErr | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 265461 | 74631 | 1237679440425582734 | 25.917360 | 22.450760 | 21.419075 | 20.998936 | 21.390163 | 0.569180 | 0.188862 | 0.163015 | 0.178872 | 0.480358 |
| 265462 | 74632 | 1237679440425582965 | 23.639347 | 22.198999 | 21.031853 | 20.623516 | 20.220276 | 1.164313 | 0.207720 | 0.168991 | 0.189767 | 0.281953 |
| 265463 | 74633 | 1237679440425582993 | 25.019598 | 23.568682 | 21.386972 | 20.716246 | 20.140373 | 1.166885 | 0.510955 | 0.181036 | 0.170606 | 0.219201 |
| 265464 | 74635 | 1237679440425583065 | 24.128036 | 22.615719 | 22.088884 | 22.170218 | 22.780581 | 0.840129 | 0.162339 | 0.193105 | 0.295670 | 0.616266 |
| 265465 | 74636 | 1237679440425583321 | 24.797550 | 24.092091 | 22.128248 | 21.591732 | 21.169786 | 0.827333 | 0.469421 | 0.201903 | 0.198841 | 0.295564 |
| 265466 | 74637 | 1237679440425583329 | 24.976448 | 24.600201 | 22.031639 | 21.277906 | 21.742359 | 0.993866 | 0.745757 | 0.237257 | 0.203570 | 0.556340 |
| 265467 | 74638 | 1237679440425583352 | 22.115126 | 22.939453 | 21.892546 | 21.643509 | 21.161491 | 0.297924 | 0.287982 | 0.238297 | 0.287231 | 0.418816 |
| 265468 | 74639 | 1237679440425583393 | 25.053223 | 23.429272 | 21.970121 | 21.796354 | 21.764332 | 0.960138 | 0.374865 | 0.223337 | 0.289326 | 0.562293 |
| 265469 | 74640 | 1237679440425583406 | 23.518948 | 23.862247 | 22.476570 | 23.323412 | 22.473553 | 0.665053 | 0.426565 | 0.271013 | 0.674657 | 0.628833 |
| 265470 | 74641 | 1237679440425583552 | 23.850555 | 23.662268 | 22.625984 | 21.673103 | 20.727907 | 0.995684 | 0.475650 | 0.380495 | 0.279438 | 0.296898 |